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PLANT ACYLTRANSFERASES 
TECHNICAL FIELD 

The invention relates to alteration of oil content in plants. 

BACKGROUND 

5 Triacylglycerols produced in plant tissues (e.g., canola seeds) are a significant 

source of fatty acids in the human diet. Diets rich in animal fat appear to promote 
maladies such as heart disease in humans. The ability to modulate triacyl glycerol 
synthesis in plants could allow for production of fatty acid compositions that are more 
beneficial for humans or more efficiently synthesized by the plant. Cloning and 
10 characterization of plant genes encoding enzymes involved in triacylglycerol 
synthesis and metabolism represents a major advance toward controlling 
triacylglycerol synthesis in plants. 

SUMMARY 

A new plant acyltransferase gene from Brassica napus has been cloned and 
15 characterized. The protein encoded by this gene is involved in triacylglycerol 

synthesis. Partial DNA sequences of the B. napus gene confirm that it is related to an 
Arabidopsis thaliana putative acyltransferase gene. Antisense expression of the B. 
napus sequence decreases total oil content. 

In one aspect, the invention features an isolated nucleic acid having at least 
20 80% (e.g., 85%, 90%, or 99%) sequence identity to the nucleotide sequence of SEQ 
ID NO:3, or to a fragment of the nucleotide sequence of SEQ ID NO:3, the fragment 
being at least 15 nucleotides in length. An isolated nucleic acid having at least 80% 
(e.g., 85%, 90%, or 99%) sequence identity to the nucleotide sequence of SEQ ID 
NO:4, or to a fragment of the nucleotide sequence of SEQ ID NO:4, the fragment 
25 being at least 15 nucleotides in length is also featured. 

In another aspect, the invention features an isolated nucleic acid that includes 
a first and second region, the first region having at least 80% sequence identity to the 
nucleotide sequence of SEQ ID NO:3 and the second region having at least 80% 
sequence identity to the nucleotide sequence of SEQ ID NO:4. The nucleic acid can 
30 encode a diacylglycerol acyltransferase polypeptide. The first and second regions can 
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be separated by approximately 600 nucleotides. The nucleic acid can be the insert of 
pMB143. An expression vector including the nucleic acid operably linked to an 
expression control element also is featured. The nucleic acid can be operably linked 
in antisense orientation. 

5 In another aspect, the invention features a transgenic plant and progeny thereof 

that include an exogenous nucleic acid encoding a diacylglycerol acyltransferase 
polypeptide operably linked to a regulatory element, and seed produced by such 
plants. The nucleic acid can include a first and second region, the first region having 
at least 80% sequence identity to the nucleotide sequence of SEQ ID NO:3, the 
10 second region having at least 80% sequence identity to the nucleotide sequence of 
SEQ ID NO:4. The transgenic plant can produce seeds that exhibit a statistically 
significantly altered oil content as compared to seeds produced by a corresponding 
plant lacking the nucleic acid encoding the diacylglycerol acyltransferase polypeptide. 
The plant can be a soybean plant or a Brassica plant. 

15 The invention also features a nucleic acid that includes the nucleotide 

sequence of SEQ ID NO:3 or SEQ ID NO:4 or the nucleotide sequence exactly 
complementary to SEQ ID NO:3 or SEQ ID NO:4. The nucleic acid also can include 
the RNA equivalent of the nucleotide sequence of SEQ ID NO:3 or SEQ ID NO:4, or 
an RNA equivalent that is exactly complementary to the nucleotide sequence of SEQ 

20 ID NO:3 or SEQ ID NO:4. The nucleotide sequences of SEQ ID NO:3 and SEQ ID 
NO:4 are partial sequences of the new B. napus gene. 

In another aspect, the invention features a nucleic acid that (I ) hybridizes 
under stringent conditions to a DNA molecule consisting of the nucleotide sequence 
of SEQ ID NO:3 or SEQ ID NO:4, and (2) encodes a plant acyltransferase. In 

25 addition, the invention includes a nucleic acid (I ) having a nucleotide sequence which 
is at least 80% (e.g., at least 82, 85, 90, 92, 95, 98, or 99%) identical to the nucleotide 
sequence of SEQ ID NO:3 or SEQ ID NO:4, and (2) encoding a plant acyltransferase. 
As used herein, the term "stringent conditions" means hybridization at 42°C in the 
presence of 50% formamide; a first wash at 65°C with 2X SSC containing l% SDS; 

30 followed by a second wash at 65°C with 0.1 X SSC. 
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The nucleotide sequence fragments described below can be used to hybridize 
against cDNA or genomic DNA libraries from a variety of sources to clone genes 
related to the B. napus acyltransferase gene. In addition, the sequence fragments can 
be used to design additional primers for further sequencing of the B. napus gene or for 
5 PCR amplification of portions of the B. napus gene. 

Unless otherwise defined, all technical and scientific terms used herein have 
the same meaning as commonly understood by one of ordinary skill in the art to 
which this invention belongs. Although methods and materials similar or equivalent 
to those described herein can be used to practice the invention, suitable methods and 
10 materials are described below. All publications, patent applications, patents, and 

other references mentioned herein are incorporated by reference in their entirety. In 
case of conflict, the present specification, including definitions, will control. In 
addition, the materials, methods, and examples are illustrative only and not intended 
to be limiting. 

15 Other features and advantages of the invention will be apparent from the 

following detailed description, and from the claims. 

DESCRIPTION OF DRAWINGS 

Figure 1 is a schematic of a construct containing the acyltransferase. 
Restriction sites are marked. 

20 Figure 2 is a schematic of a construct containing the acyltransferase in 

antisense orientation. Restriction sites are marked. 

DETAILED DESCRIPTION 

Triacylglycerols (TAGs) are synthesized by the sequential transfer of acyl 
chains to the glycerol backbone by a series of enzymes in the Kennedy pathway 

25 (glycerol-3 -phosphate acyltransferase, lysophosphatidic acid acyltransferase, and 
diacylglycerol acyltransferase). See, Topfer et al., Science , 1995, 268:68 1-686. 
Diacylglycerol can be used to form TAGs or membrane glycerolipids, and is the 
substrate for diacylglycerol acyltransferase (DAG AT, E.C. 2.3.1 .20). DAGAT 
transfers a third acyl chain to diacylglycerol, forming a TAG, and is the only enzyme 

30 unique to TAG synthesis in the Kennedy pathway. Thus, the reaction catalyzed by 
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DAGAT represents a key branchpoint in TAG synthesis. As described herein, a 1 A 
kB gene that encodes DAGAT has been identified from Brass ica and can be used to 
alter total oil content in plants. 

DAGAT Nucleic Acid Molecules 

5 The invention features isolated nucleic acids having at least 80% sequence 

identity, e.g., 85%, 90%, 95%, or 99% sequence identity, to the nucleic acid of SEQ 
ID NO:3 or SEQ ID NO:4, or fragments of the nucleic acid of SEQ ID NO:3 or 4 that 
are at least about 1 5 nucleotides (nt) in length (e.g., at least 1 8, 20, 22, 24, 26, 28, or 
30 nt). In one embodiment, the nucleic acid includes a first region having at least 
10 80% sequence identity to the nucleotide sequence of SEQ ID NO: 3 and a second 
region having at least 80% sequence identity to the nucleotide sequence of SEQ ID 
NO:4. The first and second regions can be separated by about 590 to about 700 
nucleotides, e.g., about 600 nucleotides. 

Generally, percent sequence identity is calculated by determining the number 
15 of matched positions in aligned nucleic acid sequences, dividing the number of 

matched positions by the total number of aligned nucleotides, and multiplying by 100. 
A matched position refers to a position in which identical nucleotides occur at the 
same position in aligned nucleic acid sequences. The total number of aligned 
nucleotides refers to the number of nucleotides from SEQ ID NO:3 or NO:4 that are 
20 being aligned. Nucleic acid sequences can be aligned by the Clustal algorithm of 
MEGALIGN® (DNASTAR, Madison, WI, 1997) sequence alignment software. In 
this method, sequences are grouped into clusters by examining the distances between 
all pairs. Clusters are aligned as pairs, then as groups. A gap penalty of 100 and a 
gap length penalty of 2 are used in the alignments. 

25 Isolated nucleic acid molecules of the invention can be produced by standard 

techniques. As used herein, "isolated" refers to a sequence corresponding to part or 
all of a gene encoding a DAGAT polypeptide, but free of sequences that normally 
flank one or both sides of the wild-type gene in a plant genome. As used herein, 
"polypeptide" refers to a chain of least eight amino acids. An isolated nucleic acid 

30 can be, for example, a DNA molecule, provided one of the nucleic acid sequences 
normally found immediately flanking that DNA molecule in a naturally-occurring 
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genome is removed or absent. Thus, an isolated nucleic acid includes, without 
limitation, a DNA molecule that exists as a separate molecule (e.g., a cDNA or 
genomic DNA fragment produced by PCR or restriction endonuclease treatment) 
independent of other sequences as well as recombinant DNA that is incorporated into 
5 a vector, an autonomously replicating plasmid, a virus (e,g., a retrovirus, adenovirus, 
or herpes virus), or into the genomic DNA of a prokaryote or eukaryote. In addition, 
an isolated nucleic acid can include a recombinant DNA molecule that is part of a 
hybrid or fusion nucleic acid. A nucleic acid existing among hundreds to millions of 
other nucleic acids within, for example, cDNA libraries or genomic libraries, or gel 
10 slices containing a genomic DNA restriction digest, is not to be considered an isolated 
nucleic acid. 

isolated nucleic acid molecules are at least about 1 5 nucleotides in length. For 
example, the nucleic acid molecule can be about 20 to 35, 40-50, 50-100, or greater 
than 150 nucleotides in length, e.g., 200-300, 300-500, 500-1000, or 1 100-1500 

15 nucleotides in length. Such fragments, whether encoding a polypeptide or not, can be 
used as probes, primers, and diagnostic reagents. In some embodiments, the isolated 
nucleic acid molecules encode a full-length DAGAT polypeptide. Nucleic acid 
molecules can be DNA or RNA, linear or circular, and in sense or antisense 
orientation. The nucleic acid molecules also can be complementary to the nucleotide 

20 sequences of SEQ ID NO:3 or SEQ ID NO:4. A nucleic acid encoding a DAGAT 
polypeptide may or may not contain introns within the coding sequence. 

Polymerase chain reaction (PCR) techniques can be used to produce nucleic 
acid molecules of the invention. PCR refers to a procedure or technique in which 
target nucleic acids are amplified. Sequence information from the ends of the region 

25 of interest or beyond typically is employed to design oligonucleotide primers that are 
identical in sequence to opposite strands of the template to be amplified. PCR can be 
used to amplify specific sequences from DNA as well as RNA, including sequences 
from total genomic DNA or total cellular RNA. Primers are typically 14 to 40 
nucleotides in length, but can range from 10 nucleotides to hundreds of nucleotides in 

30 length. General PCR techniques are described, for example in PCR Primer: A 

Laboratory Manual, Ed. by Dieffenbach, C. and Dveksler, G., Cold Spring Harbor 
Laboratory Press, 1995. 
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Nucleic acids encoding DAGAT polypeptides also can be produced by 
chemical synthesis, either as a single nucleic acid molecule or as a series of 
oligonucleotides. For example, one or more pairs of long oligonucleotides (e.g., >100 
nucleotides) can be synthesized that contain the desired sequence, with each pair 
5 containing a short segment of complementarity (e.g., about 1 5 nucleotides) such that a 
duplex is formed when the oligonucleotide pair is annealed. DNA polymerase is used 
to extend the oligonucleotides, resulting in a double-stranded nucleic acid molecule 
per oligonucleotide pair, which then can be ligated into a vector. 

It should be appreciated that many different nucleic acids will encode a 
10 polypeptide having a particular DAGAT amino acid sequence. The degeneracy of the 
genetic code is well known in the art, i.e., many amino acids are encoded by more 
than one nucleotide codon. It should also be appreciated that certain amino acid 
substitutions can be made within polypeptide sequences without affecting the function 
of the polypeptide. Conservative amino acid substitutions or substitutions of similar 
15 amino acids often are tolerated without affecting polypeptide function. Similar amino 
acids can be those that are similar in size and/or charge properties. Similarity between 
amino acids has been assessed in the art. For example, Dayhoff et al. (1978) in Atlas 
of Protein Sequence and Structure, Vol. 5, Suppl. 3, pp. 345-352, provides frequency 
tables for amino acid substitutions that can be employed as a measure of amino acid 
20 similarity. 

Transgenic Plants 

The invention features transgenic plants that have altered total oil content, i.e., 
increased or decreased oil content. Suitable plant species include, for example, 
Brassica spp. such as B. napus, B. campestris, B.juncea, and B. rapa (canola-type 
25 and high erucic acid rapeseed), soybean, sunflower, castor bean, safflower, crambe, 
palm, coconut, corn, cottonseed, olive, peanut, flax, and sesame. Canola, soy, 
sunflower, and safflower plants having increased oil content are particularly useful. 
Table 1 provides relative percent oil and protein content on a dry weight basis (unless 
indicated otherwise) of suitable oilseed plants. 

30 The present invention describes a novel method of making transgenic plants 

that produce seeds with a statistically significant alteration in oil content. As used 
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herein, "statistically significant" refers to a p-value of less than 0.05, e.g., a p-value of 
less than 0.025 or a p-value of less than 0.01, using an appropriate measure of 
statistical significance, e.g., a one-tailed two sample t-test. By using this method, 
plants can be produced that exhibit an altered oil content in their seeds. The altered 
oil content is statistically significant relative to the oil content of unmodified seeds. 
Plants produced by the method of the present invention can produce seeds having an 
increase in oil of from about 1% to about 25% over the oil content in seeds produced 
by unmodified control plants. For example, the increase in oil content for plants 
described herein can be from about 2% to about 20%, from about 4% to about 15%, 
from about 5% to about 10%, or from about 10% to about 20%, relative to 
unmodified plants. 

TABLE 1 

Relative Percent Oil and Protein Content of Oilseed Plants 



Plant 



% Oil 



% Protein 



Key 



Soybean 
(Glycine max) 



-20 



-40 



Rapeseed 
(Brassica napus) 



40-44 



38-41 (oil 
free meal) 



C;D 



Sunflower 
(Helianthus annus) 



Castor bean 
(Ricinus communis) 



40 



D 



50 



Safflower 
(Carthamus tinctorius) 
Crambe 

{Cramhe abyssinica) 



Palm 

(Elaeis guineensis) 



36.8-47.7 



15.4-22.5 



D 



30-35 



-28 



B 



20 
>50 



C; per fresh fruit bunch (-20% 

moisture); 

Dried kernels 



Coconut 

(Cocos nucifera) 



Maize 
(Zea mays) 



Cottonseed 
(Gossypium hirsutum) 



Olive 
{Olea europaea) 



Peanut 
(Arachis hypogaea) 



34 
69 



3.5 



D; coconut flesh (50% moisture); 
dried kernels 



3.1-5.7 



6-12 



C;D 



25-30 



25-30 



D; kernel 



19.6 



1.6 



fruit (52.4% moisture) 



36-56 



25-30 



C; (unknown moisture) 



Flax 

(Linum usiiatissimum) 



35-45 



D; per fruit capsule (~10 
seeds/fruit) 
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Sesame 


53.3-57.5 


25-30 


D; (5-7% moisture) 


(Sesamum indicum) 







(A) Brigham RD, 1993, Castor: Return of an old crop, p 380-3. In New Crops, 
Janick, J & Simon, JE, eds. Wiley, NY. 

(B) Grombacher et al., Cooperative Extension, Institute of Agriculture and 

5 Natural Resources, University of Nebraska-Lincoln, Crambe production, Publication 
G93-1 126A, Gl 126 (Field Crops), F-l 7 (Misc. Crops); see also pubs@unlvm.unl.edu 

(C) In Principles of Cultivar Development, 1987, Fehr, WR, ed., Macmillan 
Publishing Co., NY. 

(D) In 5 th Edition Bailey's Industrial Oil & Fat Products, Vol. 2, Edible Oil 
10 & Fat Products: Oils and Oil Seeds, 1996, Hui, YH, ed., Wiley, NY. 

A plant described herein may be used as a parent to develop a plant line, or 

may itself be a member of a plant line, i.e., it is one of a group of plants that display 

little or no genetic variation between individuals for total oil content. Such lines can 

be created by several generations of self-pollination and selection, or vegetative 

15 propagation from a single parent using tissue or cell culture techniques known in the 

art. Additional means of breeding plant lines from a parent plant are known in the art. 

In general, plants of the invention can be obtained by introducing at least one 
exogenous nucleic acid encoding a DAGAT polypeptide into plant cells. As used 
herein, the term "exogenous" refers to a nucleic acid that is introduced into the plant 

20 Exogenous nucleic acids include those that naturally occur in the plant and have been 
introduced to provide one or more additional copies, as well as nucleic acids that do 
not naturally occur in the plant. Typically, a nucleic acid construct containing a 
nucleic acid encoding a DAGAT polypeptide is introduced into a plant cell. Seeds 
produced by a transgenic plant can be grown and selfed (or outcrossed and selfed) to 

25 obtain plants homozygous for the construct. Seeds can be analyzed to identify those 
homozygotes having the desired expression of the construct. Transgenic plants can be 
entered into a breeding program, e.g., to increase seed, to introgress the novel 
construct into other lines or species, or for further selection of other desirable traits. 
Alternatively, transgenic plants can be obtained by vegetative propagation of a 

30 transformed plant cell, for those species amenable to such techniques. 

Progeny of a transgenic plant are included within the scope of the invention, 
provided that such progeny exhibit altered oil content. Progeny of an instant plant 
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include, for example, seeds formed on Fj, F 2 , F 3 , and subsequent generation plants, or 
seeds formed on BQ, BC 2 , BC3, and subsequent generation plants. 

Transgenic techniques for use in the invention include, without limitation, 
Agro bacterium-mediated transformation, electroporation, and particle gun 
5 transformation. Illustrative examples of transformation techniques are described in 
WO 99/43202 and U.S. Patent 5,204,253 (particle gun) and U.S. Patent 5,188,958 
(Agrobacterium). Transformation methods utilizing the Ti and Ri plasm ids of 
Agrobacterium spp. typically use binary type vectors. Walkerpeach, C. et al., in Plant 
Molecular Biology Manual, S. Gelvin and R. Schilperoort, eds., Kluwer Dordrecht, 
10 CI :1-19 (1994). If cell or tissue cultures are used as the recipient tissue for 

transformation, plants can be regenerated from transformed cultures by techniques 
known to those skilled in the art. In addition, various plant species can be 
transformed using the pollen tube pathway technique. 

Nucleic acid constructs suitable for producing transgenic plants of the 
15 invention include a nucleic acid encoding a DAG AT polypeptide operably linked to a 
regulatory element such as a promoter. Standard molecular biology techniques can be 
used to generate nucleic acid constructs. To increase oil content in plants, the nucleic 
acid encoding a DAGAT polypeptide is operably linked to the regulatory element in 
sense orientation. 

20 Suitable promoters can be constitutive or inducible, and can be seed-specific. 

As used herein, "constitutive promoter" refers to a promoter that facilitates the 
expression of a nucleic acid molecule without significant tissue- or temporal- 
specificity. An inducible promoter may be considered to be a "constitutive 
promoter", provided that once induced, expression of the nucleic acid molecule is 

25 relatively constant or uniform without significant tissue- or temporal-specificity. 
Suitable promoters are known (e.g., Weising et aL, Ann. Rev. Genetics 22:421-478 
(1988)). The following are representative examples of promoters suitable for use 
herein: regulatory sequences from fatty acid desaturase genes (e.g., Brassica fad2D or 
fad2F, see WO 00/07430); alcohol dehydrogenase promoter from corn; light 

30 inducible promoters such as the ribulose bisphosphate carboxylase (Rubisco) small 
subunit gene promoters from a variety of species; major chlorophyll a/b binding 
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protein gene promoters; the 19S promoter of cauliflower mosaic virus (CaMV); as 
well as synthetic or other natural promoters that are either inducible or constitutive. 
In one embodiment, regulatory sequences are seed-specific, i.e., the particular gene 
product is preferentially expressed in developing seeds and expressed at low levels or 
5 not at all in the remaining tissues of the plant. Non-limiting examples of seed-specific 
promoters include napin, phaseolin, oleosin, and cruciferin promoters. 

Additional regulatory elements may be useful in the nucleic acid constructs of 
the present invention, including, but not limited to, polyadenylation sequences, 
enhancers, introns, and the like. Such elements may not be necessary for expression 

10 of a DA GAT polypeptide, although they may increase expression by affecting 
transcription, stability of the mRNA, translational efficiency, or the like. Such 
elements can be included in a nucleic acid construct as desired to obtain optimal 
expression of the acyltransferase nucleic acid in the host cell(s). Sufficient 
expression, however, may sometimes be obtained without such additional elements. 

15 A reference describing specific regulatory elements is Weising et ai, Ann. Rev. 
Genetics 22:421-478(1988). 

In some situations, a decreased oil content may be desired. A feature of the 
invention is that DAGAT activity can be reduced by gene silencing, antisense, 
ribozymes, cosuppression, or mutagenesis techniques, resulting in a decrease in oil 
20 content. Gene silencing techniques, such as that described in WO 98/36083 are 

useful. Antisense RNA has been used to inhibit plant target genes in a tissue-specific 
manner. See, for example, U.S. Patent Nos. 5,453,566, 5,356,799, and 5,530,192. 
Antisense nucleic acid constructs include a partial or a full-length coding sequence 
operably linked to at least one suitable regulatory sequence in antisense orientation. 

25 Expression of DAGAT also can be inhibited by ribozyme molecules designed 

to cleave DAGAT mRNA transcripts. While various ribozymes that cleave mRNA at 
site-specific recognition sequences can be used to destroy DAGAT mRNAs, 
hammerhead ribozymes are particularly useful. Hammerhead ribozymes cleave 
mRNAs at locations dictated by flanking regions that form complementary base pairs 

30 with the target mRNA. The sole requirement is that the target RNA contain a 5*-UG- 
3 1 nucleotide sequence. The construction and production of hammerhead ribozymes is 
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well known in the art. See, for example, U.S. Patent No. 5,254,678. Hammerhead 
ribozyme sequences can be embedded in a stable RNA such as a transfer RNA 
(tRNA) to increase cleavage efficiency in vivo. Perriman, R. et a!., Proc. Natl. Acad. 
Sci. USA, 92(1 3):61 75-6 179 (1995); de Feyter, R. and Gaudron, J., Methods in 
5 Molecular Biology, Vol. 74, Chapter 43, "Expressing Ribozymes in Plants", Edited by 
Turner, P.C, Humana Press Inc., Totowa, NJ (1 997). RNA endoribonucleases such as 
the one that occurs naturally in Tetrahymena thermophila, and which have been 
described extensively by Cech and collaborators also are useful. See, for example, 
U.S. Patent No. 4,987,071 . 

10 The phenomenon of co-suppression also has been used to inhibit plant target 

genes in a tissue-specific manner. Co-suppression of an endogenous gene using a 
full-length cDNA sequence as well as a partial cDNA sequence are known. See, for 
example, WO 94/1 1516, and U.S. Patent Nos. 5,451,514 and 5,283,124. Co- 
suppression of DAGAT activity in plants can be achieved by expressing, in the sense 

15 orientation, the entire or partial coding sequence of a DAGAT gene. 

Mutagenesis can also be used to reduce acytltransferase activity in plants. 
Mutagenic agents can be used to induce random genetic mutations within a population 
of seeds or regenerable plant tissue. Suitable mutagenic agents include, for example, 
ethyl methyl sulfonate, methyl N-nitrosoguanidine, ethidium bromide, diepoxybutane, 

20 x-rays, UV rays, and other mutagens known in the art. The treated population, or a 
subsequent generation of that population, is screened for reduced oil content or 
reduced DAGAT activity that results from the mutation. Mutations can be in any 
portion of a gene, including the coding region, introns, and regulatory elements, that 
render the resulting gene product non-functional or with reduced activity. Suitable 

25 types of mutations include, for example, insertions or deletions of nucleotides, and 
transitions or transversions in the wild-type coding sequence. Such mutations can 
lead to deletion or insertion of amino acids, and conservative or non-conservative 
amino acid substitutions in the corresponding gene product. 

Characterization of Oils 

30 Techniques that are routinely practiced in the art can. be used to extract, 

process, and analyze the oils produced by plants of the instant invention. Typically, 
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plant seeds are cooked, pressed, and extracted to produce crude oil, which is then 
degummed, refined, bleached, and deodorized. Generally, techniques for crushing 
seed are known in the art. For example, soybean seeds can be tempered by spraying 
them with water to raise the moisture content to, e.g., 8.5%, and flaked using a 
5 smooth roller with a gap setting of 0.23 to 0.27 mm. Depending on the type of seed, 
water may not be added prior to crushing. Application of heat deactivates enzymes, 
facilitates further cell rupturing, coalesces the oil droplets, and agglomerates protein 
particles, all of which facilitate the extraction process. 

The majority of the seed oil can be released by passage through a screw press. 

10 Cakes expelled from the screw press are then solvent extracted, e.g., with hexane, 
using a heat traced column. Alternatively, crude oil produced by the pressing 
operation can be passed through a settling tank with a slotted wire drainage top to 
remove the solids that are expressed with the oil during the pressing operation. The 
clarified oil can be passed through a plate and frame filter to remove any remaining 

15 fine solid particles. If desired, the oil recovered from the extraction process can be 
combined with the clarified oil to produce a blended crude oil. 

Once the solvent is stripped from the crude oil, the pressed and extracted 
portions are combined and subjected to normal oil processing procedures (i.e., 
degumming, caustic refining, bleaching, and deodorization). Degumming can be 
20 performed by addition of concentrated phosphoric acid to the crude oil to convert 
non-hydratable phosphatides to a hydratable form, and to chelate minor metals that 
are present. Gum is separated from the oil by centrifugation. The oil can be refined 
by addition of a sufficient amount of a sodium hydroxide solution to titrate all of the 
fatty acids and removing the soaps thus formed. 

25 Deodorization can be performed by heating the oil to 500°F (260°C) under 

vacuum, and slowly introducing steam into the oil at a rate of about 0. 1 
ml/minute/100 ml of oil. After about 30 minutes of sparging, the oil is allowed to 
cool under vacuum. The oil is typically transferred to a glass container and flushed 
with argon before being stored under refrigeration. If the amount of oil is limited, the 

30 oil can be placed under vacuum, e.g., in a Parr reactor and heated to 500°F for the 
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same length of time that it would have been deodorized. This treatment improves the 
color of the oil and removes a majority of the volatile substances. 

Oil content can be measured by NMR using AOCS Method AM 2-93 and 
AOCS Recommended Practice AK 4-95 or by near infra-red reflectance spectroscopy 
5 (NIR) using AOCS Method AK 3-94 and AOCS Procedure AM 1-92. Oil 

composition can be analyzed by extracting fatty acids from bulk seed samples (e.g., at 
least 10 seeds). Fatty acid TAGs in the seed are hydrolyzed and converted to fatty 
acid methyl esters. Percentages of fatty acids typically are designated on a weight 
basis and refer to the percentage of the tatty acid methyl ester in comparison with the 

10 total fatty acid methyl esters in the sample being analyzed. Seeds having an altered 
fatty acid composition may be identified by techniques known to the skilled artisan, 
e.g., gas-liquid chromatography (GLC) analysis of a bulked seed sample, a single seed 
or a single half-seed. Half-seed analysis is well known in the art to be useful because 
the viability of the embryo is maintained and thus those seeds having what appears to 

15 be a desired fatty acid profile may be planted to form the next generation. However, 
bulk seed analysis typically yields a more accurate representation of the fatty acid 
profile of a given genotype. Fatty acid composition can also be determined on larger 
samples, e.g., oil obtained by pilot plant or commercial scale refining, bleaching and 
deodorizing of endogenous oil in the seeds. 

20 The following examples are to be construed as merely illustrative of how one 

skilled in the art can make and use the DAGAT gene fragments, and does not limit the 
scope of the invention described in the claims. 

EXAMPLES 

Example 1 — Cloning of a Brassica Acyl Transferase: A mouse acyl 
25 CoA:diacylglycerol acyltransferase (DAGAT) gene was recently identified (Case et 
al., Proc Natl Acad Sci USA 95:13018-13023, 1998). The deduced mouse DAGAT 
amino acid sequence was used to search for similar plant sequences in GenBank and 
dbEST databases. The mouse DAGAT protein sequence aligned with significant 
sequence identity and similarity to a putative Arabidopsis thaliana acyl- 
30 CoAxholesterol acyltransferase (ACAT; GenBank Accession No. 3135276, locus 
ATAC003058). However, upon aligning the A thaliana ACAT with the mouse 
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DAGAT and mouse ACAT (Case et al. 5 J Biol Chem 273:26755-26764, 1998) 
sequences, it was found that the A. thaliana protein had greater homology to mouse 
DAGAT than to mouse ACAT. Thus, the type of acyltransferase encoded by the A. 
thaliana gene was unclear. 

5 The genomic DNA sequence encoding the A. thaliana protein (GenBank 

Accession No. 3135276) was used to design two PGR primers for amplifying 
candidate DAGAT or ACAT genomic sequences from different sources. The 5 1 
DAGAT- 1 primer had the sequence 

caucaucaucauACTGCCATGGACAGGTGTGATTCTGCTTTTT TATCA (SEQ ID 
10 NO:l), and the 3' DAGATD2 primer had the sequence 

cuacuacuacuaCTAGAGACAGGGCAATGTAGAAAGTATGTA (SEQ ID NO:2). 
Lowercase sequences were used for cloning into the pAMPl vector (Gibco, BRL). 

PCR amplification using DAGAT- 1 and DAGAT-2 primers was carried out as 
follows. Each 100|il PCR reaction mixture contained 50ng of genomic DNA, 200jiM 

15 of each dNTP, IX buffer B (Gibco BRL), 1 jxM DAGAT- 1 primer, 1 ^iM DAGAT-2 
primer, 3mM magnesium sulfate, and 2jjJ Elongase enzyme (Gibco BRL). The 
reaction mixture was denatured at 94°C for 3 minutes, followed by 30 cycles of 
denaturation at 94°C for 1 minute, annealing at 50°C for 2 minutes, and extension at 
72°C for 3 minutes. A final extension incubation was performed at 72°C for 10 

20 minutes after cycling. Based on the sequence of A, thaliana genomic DNA in 

GenBank Accession No. 3 135276, a 1369 bp fragment was expected to be amplified 
from Brassica. 

An approximately 1 .4 kb fragment was amplified from genomic DNA isolated 
from Brassica napus variety Westar under PCR conditions described above. The 
25 amplified DNA fragment was cloned into the pAMPl vector (Gibco BRL) and 

partially sequenced from the 5' end using a T7 universal primer and from the 3' end 
using a Sp6 universal primer. Figure 1 contains a restriction map of the pAMPl 
vector containing the putative DAGAT fragment, which has been designated 
pMB143. The partial sequence of the 5' end using the T7 primer was 
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ATGGACAGGTGTGATTCTGCTTTTTTATCAGGTGTCACGTTGATGCTCCTC 
ACTTGCATTGTGTGGCTGAAGTTGGTTTCTTACGCTCATACTAACTATGAC 
ATAAGAACCCTAGCTAATTCATCTGATAAGGTAAAAGAAGTGATATAATA 
TTGGTCACTTGCATTGTGTTTTACTATTTTGACCAGACACTGTTGAAAACT 
5 GTAGGCCAATCCTGAAGTCTCCTACTATGTTAGCTTGAAGAGCTTGGCGTA 
TTTCATGCTTGCTCCCACATTGTGTTATCAGGTAATCTGATGCGTCTTCTGC 
TAATTGTATCATACATTATCTTTCACTTGCAAAAGTTTCTTGTCTAAAACCT 
TGCGTCTTCGCTTTACCCAGCCGAGCTATCCACGTTCT (SEQ ID NO:3). 

The partial sequence of the 3' end using the Sp6 primer was 

1 o ATCAATCTTGTCTTACTCA A AAATCATATTATGTTTACGTTANTAACCAAA 
ATTCATGTACGCACTGTCTACCTTTGTCAGTATTGGAGAATGTGGAATATG 
GTATGGTTCTCTTCTTGAACATCCCCTTCTTTTTTTATACAAAGCAGATTAA 
GAAAAGCTTATTGAGATCTTGTTTTTTCTAATAGCCTGTTCATAAATGGAT 
GGTTCGACATGTATACTTTCCGTGCCTTCGCAGAAATATACCGAAAGTGA 

1 5 GTGTAGTTAATTGCG ATG ATCG ATATTTTTTTCTGTGCTTC ATAAATTTA AC 
CCTCCACTCATTTTTTTCCAGGTACCCGCTATTATCCTTGCTTTCTTAGTCT 
CTGCAGTCTTTCATGAGGTATAATACATACTTTCTACATTGNCCCTGTCTC 
(SEQ ID NO:4). 

The sequence of the B. napus clone obtained using the T7 primer had an 
20 overall identity of 76.5% with the corresponding region of the A. thaliana gene, 

suggesting that the 1 .4 kb cloned B. napus fragment encoded a protein that may be 
related to the A. thaliana DAGAT or ACAT. 

Plasmid pMB143 will be deposited with the American Type Culture 
Collection (ATCC), 10801 University Blvd., Manassas, VA, 201 10-2209. 

25 Example 2 - Reduction of Oil Content in Brassica Plants Expressing 

Antisense DAGAT: The putative DAGAT genomic DNA cloned in pAMPl was 
excised with a Smal/SnaBl digestion. The putative DAGAT fragment was purified 
and inserted at the blunted, EcoRI site of pMBl 10 to generate pMBl 71. This 
construct contains cruciferin promoter and cruciferin termination sequences. The 

30 antisense orientation of the putative DAGAT insert in pMB 1 7 1 was confirmed with 
restriction analysis. The cruciferin promoter/putative DAGAT/cruciferin termination 
cassette was released from pMB171 by partially digesting with Xbal and completely 
digesting with Xhol, and cloned into the Xhol/Xbal sites of pMOG800 to generate 
pMB 1 70. Figure 2 provides a diagram of pMB 1 70. 

35 The construct pMB170 was used to transform Agrobacterium LBA4404. The 

resulting Agrobacterium transformants were each co-cultivated separately with B. 
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napus variety Westar hypocotyls and cultured consecutively on incubation, selection 
(containing kanamycin) and regeneration media until green shoots were produced. 
Regenerated plantlets were transferred to the greenhouse and grown to maturity. 
Each Tl plant (N=56) was selfed and the resulting T2 seeds were harvested from each 
5 individual Tl plant 

Oil, protein, chlorophyll, glucosinolate, oleic acid (1 8:1), linoleic acid (18:2), 
and a-linolenic acid (18:3) content were determined in the T2 seed samples. Oil 
content was measured by NIR using a Foss NIR Systems model 6500 Feed and 
Forage Analyzer (Foss North America, Eden Prairie, MN) calibrated according to 

10 manufacturer's recommendations. Canola seed samples, which represented wide 
ranges of the sample constituents listed above, were collected for calibration. Lab 
analysis results were determined using accepted methodology (/.<?., oil, AOCS Method 
Ak 3-94; moisture, AOCS Method Ai 2-75; fatty acid, AOCS Method CE le-91 and 
AOCS Method CE 2-66; chlorophyll, AOCS Method CC 13D-55; protein, AOCS 

15 Method BA 4e-93; and giucosinolates, AOCS Method Ak 1-92). Instrument response 
also was measured for each sample. A calibration equation was calculated for each 
constituent by means of chemometrics. These equations are combined into one 
computer file and are used for prediction of the constituents contained in unknown 
canola samples. 

20 T2 seed samples containing unknown levels of the above constituents were 

prepared by removing foreign material from the sample. Cleaned whole seed was 
placed into the instrument sample cell and the cell was placed into the instrument 
sample assembly. Analysis was carried out according to manufacturer instructions 
and was based on AOCS Procedure Am 1-92. The results are predicted and reported 

25 as % constituent (% oil and protein are reported based on dry weight). Conversion 
from 'dry weight' basis to 'as is' basis for oil and protein can be calculated using the 
following formula: 

constituent (as is) = constituent (dry wt.) x [l-(% moisture/ 100)]. 

The average dried oil content of the T2 seeds (39.62±1 .95%, n=56) was not 
30 significantly different from that of the control (Westar, 39.92±1 .63, n=5). There were 
ten Tl plants, however, that produced T2 seeds having a lower total oil content. In 
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these T2 seeds, the average oil content was 36.63^0.64%. Table 2 provides an 
analysis of eight of these 1 0 T2 seeds. 

Approximately 10-20 T2 seeds from each of the ten selected plants were 
planted, and a nickel size portion of the leaf tissue was taken from plants about 2.5 
5 weeks post-germination. Tissue samples were dried in a food dehydrator at 1 35°C for 
8-16 hours. DNA was isolated using the Qiagen Dneasy96 Plant Kit, and 
resuspended in 150|il of buffer. PCR amplification was performed in a 20jjJ volume 
containing IX PCR buffer containing 1.5mM MgCl 2 (Qiagen PCR Core Kit), 0.2mM 
dNTP, 0.5 units Taq polymerase (Qiagen), 0.5^M cruciferen primer (S'-CTT TAT 

10 GGA TGA GCT TGA TTG AG-3% SEQ ID NO:5), and 0.5nM acyltransferase primer 
(5'-CCG CTC TAG AGG GAT CCA AGC-3 5 , SEQ ID NO:6), 0.4% sucrose, and 
0.008% cresol. Amplification conditions included 30 cycles of denaturation at 94°C 
for 30 seconds, annealing at 55°C for 30 seconds, and extension at 72°C for I minute. 
PCR products were analyzed 1.2% agarose gel electrophoresis. As indicated in Table 

15 1, the selected plants tested positive for the transgene. 

TABLE 2 

Analysis of T2 Seeds With Lower Oil Content 

SAMPLE OIL PROT CHLOR GLUC C18:l C18:2 C18:3 (# pos. 



plants/Total 
# plants 
tested) 


Y350071 


35.69 


26.07 


78.12 


3.66 


70.57 


10.03 


7.74 


15/18 


Y3 50070 


36.35 


24.87 


78.57 


2.60 


72.02 


8.18 


7.34 


11/18 


Y3 50095 


36.73 


26.55 


88.44 


1.38 


73.12 


7.80 


7.62 


7/9 


Y3 50079 


36.79 


27.96 


22.75 


2.96 


66.67 


14.97 


7.53 


17/18 


Y3 50052 


37.02 


26.02 


74.34 


3.44 


67.80 


14.09 


7.62 


8/18 


Y3 50077 


37.04 


25.70 


70.80 


2.91 


71.30 


9.16 


8.07 


15/18 


Y350089 


37.13 


25.07 


59.60 


2.28 


71.37 


9.93 


7.26 


8/15 


Y350078 


37.66 


25.64 


43.00 


2.35 


74.75 


7.48 


6.94 


1 1/18 


Average 


36.80 


25.99 


64.45 


2.70 


70.95 


10.23 


7.51 




Std Dev. 


0.58 


0.97 


21.79 


0.72 


2.65 


2.81 


0.34 





- 17- 



_006674SA1J_> 



WO 00/66749 



PCT/US00/12316 



OTHER EMBODIMENTS 

It is to be understood that while the invention has been described in 
conjunction with the detailed description thereof, the foregoing description is intended 
to illustrate and not limit the scope of the invention, which is defined by the scope of 
5 the appended claims. Other aspects, advantages, and modifications are within the 
scope of the following claims. 
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WHAT IS CLAIMED IS: 

I . An isolated nucleic acid having at least 80% sequence identity to the 
nucleotide sequence of SEQ ID NO:3 5 or to a fragment of the nucleotide sequence of 
SEQ ID NO:3, said fragment being at least 1 5 nucleotides in length. 

5 2. The nucleic acid of claim 1, said nucleic acid having at least 85% 

sequence identity to the nucleotide sequence of SEQ ID NO:3 or to said fragment. 

3. The nucleic acid of claim 1, said nucleic acid having at least 90% 
sequence identity to the nucleotide sequence of SEQ ID NO:3 or to said fragment. 

4. The nucleic acid of claim 1, said nucleic acid having at least 95% 
10 sequence identity to the nucleotide sequence of SEQ ID NO:3 or to said fragment. 

5. An isolated nucleic acid having at least 80% sequence identity to the 
nucleotide sequence of SEQ ID NO:4, or to a fragment of the nucleotide sequence of 
SEQ ID NO:4, said fragment being at least 15 nucleotides in length. 

6. The nucleic acid of claim 5, said nucleic acid having at least 85% 
15 sequence identity to the nucleotide sequence of SEQ ID NO:4 or to said fragment. 

7. The nucleic acid of claim 5, said nucleic acid having at least 90% 
sequence identity to the nucleotide sequence of SEQ ID NO:4 or to said fragment. 

8. The nucleic acid of claim 5, said nucleic acid having at least 95% 
sequence identity to the nucleotide sequence of SEQ ID NO:4 or to said fragment. 

20 9. An isolated nucleic acid comprising a first and second region, said first 

region having at least 80% sequence identity to the nucleotide sequence of SEQ ID 
NO:3, said second region having at least 80% sequence identity to the nucleotide 
sequence of SEQ ID NO:4. 

10. The nucleic acid of claim 9, wherein said nucleic acid encodes a 
25 diacylglycerol acyltransferase polypeptide. 

II. The nucleic acid of claim 9, wherein said first and second regions are 
separated by approximately 600 nucleotides. 

12. The nucleic acid of claim 9, wherein said nucleic acid is the insert of 
pMB143. 
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1 3. An expression vector comprising the nucleic acid of claim 9 operably 
linked to an expression control element. 

14. The expression vector of claim 13, wherein said nucleic acid is 
operably linked in antisense orientation. 

5 15. A transgenic plant comprising an exogenous nucleic acid encoding a 

diacylglycerol acyltransferase polypeptide operably linked to a regulatory element 

1 6. The transgenic plant of claim 15, said nucleic acid comprising a first 
and second region, said first region having at least 80% sequence identity to the 
nucleotide sequence of SEQ ID NO:3, said second region having at least 80% 

10 sequence identity to the nucleotide sequence of SEQ ID NO:4. 

17. The transgenic plant of claim 15, wherein said plant produces seeds 
that exhibit a statistically significantly altered oil content as compared to seeds 
produced by a corresponding plant lacking said nucleic acid encoding said 
diacylglycerol acyltransferase polypeptide. 

15 18. The plant of claim 15, wherein said plant is a soybean plant 



19. 



The plant of claim 15, wherein said plant is a Brassica plant. 



20. 



Seeds produced by the plant of claim 15. 



21. 



Progeny of the plant of claim 1 5. 
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pMB143 



PflMl 4518 
Dram 4399, 
AlwNI 4315 

AccI 4104 

Sail 4103 
Smal 4100 
Aval 4098 
PspAI 4098 
Xmal 4098 
EcoRI 4093 
BspMH 4089 
RsrO 4087 
KpnT 4085 
Agel 4084 
Acc65I 4081 
Asp718 4081 
PstI 4080 
BspMI 4069 
Bbel 3808 
Ehel 3806 
Narl 3805 
KasI 3804 
BsmBI 3780 
Hpal 3671 
EcoRV 3615 

Apal 3376 
Bsp120I 3372 
BstEQ 3346 



Tth111I 5511 
PstI 5472 
Kpnl 5440 
Acc65I 5436 
Asp718 5436 
Clal 5383 
Bst1107l 5326 
Xcal 5326 
AccI 5325 
BglO 5266 
HinDIH 5266 
AccI 5176 



XmaBI 15 
EagI 1 5 
NotI 15 

Xbal 22 
BamHI 28 
HinDDI 34 
SplI 41 
BsiWI 41 
SnaBI 41 
Mlul 44 
SphI 54 
Aatn 60 




NgoMl 378 
Nael 380 
Dram 486 

XmnI 1027 
Bsp 12861 1044 
Seal 1146 

EcoNl 1 266 



Bsal 1561 
Ahdl 1 627 



AlwNI 2106 



FIG. 1 



SUBSTITUTE SHEET (RULE 26) 



WO 00/66749 PCT/US00/12316 



2/2 



pMB170 

Mlul 12978 

BstED 12874 
PstI 1 2727 
PvuO 12622 
BstXI 1 2544 



Sfil 10911 
PflMI 10582 



Muni 9950 




Nxul 9071 
Bgin 9021 
Ahdl 8826 
PvuII 8404 



Apal 7364 
Bsp120I 7360 



EcoNI 1628 

Bgin 2345 
SphI 2372 
Pmel 2493 
Xhol 2537 
PaeR7I 2537 
PvuH 2545 
SphI 2559 
PstI 2565 
Sail 2567 

BsrGI 3090 
SplI 3536 
BsiWI 3536 

PstI 3693 
BamHI 3704 
Xbal 3710 
NotI 3717 
PstI 3786 
Kpnl 3818 
Clal 3869 
Xcal 3928 
Bst1107I 3928 
Bgin 3974 
Sail 5147 
BamHI 5594 
Sail 5931 
Xbal 5941 
BamHI 5948 
Xmal 5955 
PspAI 5955 
Smal 5957 
Kpnl 5966 
EcoRI 5969 
Bsu36I 6161 
Nhel 6176 
PstI 6480 
PvuII 6533 
SphI 6832 
BstBl7109 
BsiCI7109 
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